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Abstract 

This paper presents PROVERB a text planner for 
argumentative texts. PROVERBs main feature is 
that it combines global hierarchical planning and un- 
planned organization of text with respect to local 
derivation relations in a complementary way. The for- 
mer splits the task of presenting a particular proof into 
subtasks of presenting subproofs. The latter simulates 
how the next intermediate conclusion to be presented 
is chosen under the guidance of the local focus. 

1. Introduction 



This paper presents a text planner for the verbaliza- 
tion of natural deduction (ND) style proofs ]Gen35 |. 
Several similar attempts can be found in previous 
work. Developed before the era of NL gen eration, 
the system EXPOUND of D. Chester JChe76( | can be 
characterized as an example of direct translation: Al- 
though a sophisticated linearization is applied on the 
input ND proofs, the steps are then translated locally 
in a template driven way. ND proofs were tested as 
input to an ea rly vers ion of the MUMBLE system of 
D. McDonald [McD83|, the main aim however, was to 
show the feasibility of the architecture. A mor e recent 
attempt can be found in THINKER [EP93fl , which 
implements several interesting but isolated proof pre- 
sentation strategies, without giving a comprehensive 
underlying model. 

Our computational model can therefore be viewed 
as the first serious attempt at a comprehensive compu- 
tational model that produces adequate argumentative 
texts from ND style proofs. The main aim is to show 
how existing text planning techniques can be adapted 
for this particular application. To test its feasibility, 
this computational model is implemented in a system 
called PROVERB. 

Most current NL text planners assume that lan- 
guage generation is planned behavior and there- 



forc adopt a hierarch ical planning approach [ Hov8S , 
Moo89, Dal92, Rei91]. Nonetheless there is psycholog- 
ical evidence that lan guage h as an unplanned, sponta- 
neous aspect as well [ pch79 |. Based on this observa- 
tion, researchers have exploited organiz ing te xt with 
respect to some local relations. Sibun [[5ib9C | imple- 
mented a system generating descriptions for objects 
with a strong domain structure, such as houses, chips 
and families. Once a discourse is started, local struc- 
tures suggest the next objects available. Instead of 
planning globally, short-range strategies are employed 



to organize a short segment of text. From a computa- 
tional point of view, a hierarchical planner elaborates 
recursively on the initial communicative goal until the 
final subgoals can be achieved by applying a primitive 
operator. A text generator based on the local orga- 
nization, in contrast, repeatedly chooses a part of the 
remaining task and carries it out. 

The macroplanner of PROVERB combines hierar- 
chical planning with local organization in a uniform 
planning framework. The hierarchical planning is re- 
alized by so-called top-down presentation operators 
that split the task of presenting a particular proof 
into subtasks of presenting subproofs. While the over- 
all planning m echanism follow s the RST-based plan- 
ning approach [ Moo8£ , Rei91 ] , the planning operators 
more clos ely resemble the schemata in schema-based 



planning [McK85, Par88 . Bottom- up presentation 
operators are devised to simulate the unplanned as- 
pect, where the next intermediate conclusion to be 
presented is chosen under the guidance of the local 
focus mechanism in a more spontaneous way. Since 
top-down operators embody explicit communicative 
norms, they are always given a higher priority. Only 
when no top-down presentation operator is applicable, 
will a bottom-up presentation operator be chosen. 

This distinction between planned and unplanned 
presentation leads to a very natural segmentation of 
the discourse into an attentional hierarchy , since, fol- 
lowing the theory of Grosz and Sidner ]GS8(: ] , there is 
a one-to-one correspondence between the intentional 
hierarchy and the attentional hierarchy. This atten- 
tional hierarchy is used to make reference choices for 
inference methods and for previously presented inter- 
mediate conclusions. The inference choices are the 
main con cern of the microplanner of PROVERB (see 
[ |Hua94b(l ). 



2. Context of Our Research 

The text planner discussed in this paper is 
the macroplanner of PROVERB , which translates 
machine-found proofs in several steps into natural 
language. PROVERB adopts a reconstructive ap- 
proach: Once a proof in a machine oriented for- 
malism is generated in the proof development envi- 
ronment fi-MKRP, a new proof that more resem- 
bles those found in mathematical textbooks is recon- 



structed | Hua94a . The reconstructed proof is a proof 
tree, where proof nodes are derived from their chil- 
dren by applying an inference method (also called a 
justification). Most of the steps are justified by the 
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Figure 1: An Example Input Proof 



application of a definition or a theorem, the rest are 
justified by inference rules of the natural deduction 
(ND) calculus, such as the "Case" rule. Figure [l] is an 
example of a segment of a possible input proof, where 
some nodes are labeled for convenience. 
The justifications "Du" , "Dsubgr" , "Ds" , "Dg" , and 
"Tsol" stand for the definitions of unit element, of 
subgroup, of subset, of group, and the theorem about 
solution, respectively. 

The input proof tree is also augmented with an or- 
dered list of nodes, being roots of subproofs planned 
in this order. The proof in Figure |l| is associated with 
the list: ([2], [3], [4], [1]). 

3. The Framework of the Macroplanner 

The macroplanner of PROVERB elaborates on com- 
municative goals, selects and orders pieces of informa- 
tion to fulfill these goals. The output is an ordered se- 
quence of proof communicative act intentions (PC As). 
PCAs can be viewed as speech acts in our domain of 
application. 

Planning Framework 

PROVERB combines the two above mentioned pre- 
sentation modes by encoding communication knowl- 
edge for both top-down planning and bottom-up pre- 
sentation in form of operators in a uniform planning 
framework. Since top-down presentation operators 
embody explicit communicative norms, they are given 
a higher priority. A bottom-up presentation is cho- 
sen only when no top-down presentation operator ap- 
plies. The overall planning framework is realized by 
the function Present. Taking as input a subproof, 
Present repeatedly executes a basic planning cycle 
until the input subproof is conveyed. Each cycle car- 
ries out one presentation operator, where Present al- 
ways tries first to choose and apply a top-down op- 
erator. If impossible, a bottom-up operator will be 
chosen. The function Present is first called with the 
entire proof as the presentation task. The execution of 
a top-down presentation operator may generate sub- 
tasks by calling it recursively. The discourse produced 
by each call to Present forms an attentional unit 
(compare the subsection below). 

The Discourse Model and the Attentional Hi- 
erarchy 

The discourse carried out so far is recorded in a dis- 
course model. Rather than recording the semantic ob- 
jects and their properties, our discourse model consists 
basically of the part of the input proof tree which has 



already been conveyed. The discourse model is also 
segmented into an attentional hierarchy, where sub- 
proofs posted by a top-down presentation operators 
as subtasks constitute attentional units. The follow- 
ing are some notions useful for the formulation of the 
presentation operators: 

• Task is the subproof in the input proof whose 
presentation is the current task. 

• Local focus is the intermediate conclusion last 
presented, while the semantic objects involved in 
the local focus are called the focal centers. 

Proof Communicative Acts 

PCAs are the primitive actions planned during the 
macroplanning to achieve communicative goals. Like 
speech acts, PCAs can be defined in terms of the com- 
municative goals they fulfill as well as their possible 
verbalizations. Based on an analysis of proofs in math- 
ematical textbooks, each PC A has as goal a combina- 
tion of the following subgoals: 

1. Conveying a step of the derivation. The simplest 
PCA is the operator Derive. Instantiated as be- 
low: 

(Derive Reasons: (a G Si, Si C 5 2 ) 
Intermediate-Results: nil 
Derived-Formula: a G S2 
Method: def -subset) 

depending on the reference choices, a possible ver- 
balization is given as following: 

"Because a is an element of Si and Si 
is a subset of S2 , according to the defi- 
nition of subset, a is an element of 52." 

2. Updates of the global attentional structure. 
These PCAs sometimes also convey a partial plan 
for the further presentation. Effects of this group 
of PCAs include: creating new attentional units, 
setting up partially premises and the goal of a 
new unit, closing the current unit, or reallocating 
the attention of the reader from one attentional 
unit to another. The PCA 

(Begin-Cases Goal: Formula 

Assumptions: (A B)) 

creates two attentional units with A and B as the 
assumptions, and Formula as the goal by produc- 
ing the verbalization: 



"To prove Formula, let us consider the 
two cases by assuming A and B." 



Thirteen 



PCAs 



are 

currently employed in PROVERB. See [Hua94b] for 
more details. 

Structure of the Planning Operators 

Although top-down and bottom-up presentation ac- 
tivities are of a conceptually different nature, the cor- 
responding communication knowledge is uniformly en- 
coded as presentation operators in a planning frame- 
work, similar to the plan operators in other generation 
systems pov8Sj , |Moo89| , pa!92| , |Rei91[ |. In general, 
presentation operators map an original presentation 
task into a sequence of subtasks and finally into a se- 
quence of PCAs. All of them have the following four 
slots: 

• Proof, a proof schema, which characterizes the 
syntactical structure of a proof segment for which 
this operator is designed. It plays the role of the 
goal slot in the traditional planning framework. 

• Applicability Condition: a predicate. 

• Acts: a procedure which essentially carries out 
a sequence of presentation acts. They are either 
primitive PCAs, or are recursive calls to the pro- 
cedure Present for subproofs. 

• Features: a list of features which helps to select 
the best of a set of applicable operators. 



4. Top-Down Planning 

This section elaborates on the communicative norms 
concerning how a proof to be presented can be split 
into subproofs, as well as how the hierarchically- 
structured subproofs can be mapped onto some linear 
order for presentation. In contrast with operators em- 
ployed in RST-based planners that split goals accord- 
ing to the rhetorical structures, our operators encode 
standard schemata for presenting proofs, which con- 
tain subgoals. The top-down presentation operators 
are roughly divided into two categories: 

• schemata-based operators encoding complex 
schemata for the presentation of proofs of a spe- 
cific pattern (twelve of them are currently inte- 
grated in PROVERB), 

• general operators embodying general presenta- 
tion norms, concerning splitting proofs and or- 
dering subgoals. 
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Figure 2: A Schema Involving Cases 
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Let us first look at an operator devised for proof seg- 
ments containing cases. The corresponding schema of 
such a proof tree is shown in Figure || Under two 
circumstances a writer may recognize that he is con- 
fronted with a proof segment containing cases. First, 
when the subproof that has the structure of Figure J2 
is the current presentation task, tested by (task ?£i)q 
Second, when the disjunction F\/ G has just been pre- 
sented in the bottom-up mode, tested by (local-focus 
7^4). Under both circumstances, a communication 
norm motivates the writer to first present the part 
leading toFVG (in the second case this subgoal has 
already been achieved), and then to proceed with the 
two cases. It enforces also that certain PCAs be used 
to mediate between parts of proofs. This procedure is 
exactly captured by the presentation operator below. 
Case-Implicit 

• Proof: as given in Figure |^ 

• Applicability Condition: ((task ?Zi) V 
(local- focus 7L4)) A (not-conveyed {IL2 ?^3)) 



• Acts: 



1. if ?L4 has not been conveyed, then present 
?L 4 (subgoal 1) 

2. a PCA with the verbalization: "First, let us 
consider the first case by assuming F." 

3. present IL2 (subgoal 2) 

4. a PCA with the verbalization: "Next, we 
consider the second case by assuming 67." 

5. present ?L 3 (subgoal 3) 

6. mark ?Li as conveyed 

• features: (top-down compulsory implicit) 

The feature values can be divided into two groups: 
those characterizing the style of the text this opera- 
tor produces, and those concerning other planning as- 
pects. "Implicit" is a stylistic feature value, indicating 
that the splitting of the proof into the three subgoals is 
not made explicit. In its explicit dual Case-Explicit 
a PCA is added to the beginning of the Acts slot, 
which produces the verbalization: 

"To prove Q, let us first prove F V G, and 
consider the two cases separately." 

The feature value "compulsory" indicates that if the 
applicability condition is satisfied, and the style of the 
operator conforms to the global style the text planner 
is committed to, this operator should be chosen. Two 
weaker values also reflect the specificity of plan oper- 
ators: "specific" and "general" . 

General presentation operators perform a simple 
task according to some general text organization prin- 
ciples. They either 

• enforce a linearization on subproofs to be pre- 
sented, or 



1 Labels stand for the corresponding nodes 



• split the task of the presentation of a proof with 
ordered subproofs into subtasks. 

The first ordering operator operationalizes a gen- 
eral ordering strategy called minimal load principle. 
This principle predicates that a writer usually presents 
shorter branches before longer ones. The argument of 
Levelt is rather simple: When one branch is chosen to 
be described first, the writer has to have the choice 
node flagged in his memory for return. If he follows 
the shorter branch first, the duration of the load will 
be shorter. The concrete operator is omitted. 

Note that, the subproofs being ordered are sub- 
proofs conceptually planned while the corresponding 
proof is constructed. There are two other ordering 
operators based on general ordering principles: the lo- 
cal focus principle and the proof time order principle 
| Hua94b| . 

The invocation of an ordering operator is always 
followed by the invocation of a splitting operator, 
which actually posts subgoals by calling the function 
Present with the ordered goals subsequently. 

5. Bottom-up Presentation 

The bottom-up presentation process simulates the un- 
planned part of proof presentation. Instead of split- 
ting presentation goals into subgoals according to 
standard schemata, it follows the local derivation re- 
lation to find a next proof node or subproof to be pre- 
sented. In this sense, it is simila r to the local organi- 
zation techniques used in |5ib9Cfl . When no top-down 
presentation operator applies, PROVERB chooses a 
bottom-up operator. 

The Local Focus 

The node to be presented next is suggested by the 
mechanism of local focus. Although logically any proof 
node having the local focus as a child could be chosen 
for the next step, usually the one with the greatest se- 
mantic overlapping with the focal centers is preferred. 
As mentioned above, focal centers are semantic ob- 
jects mentioned in the proof node which is the local 
focus. This is based on the observation that if one 
has proved a property about some semantic objects, 
one tends to continue to talk about these particular 
objects before turning to new objects. Let us examine 
the situation when the proof below is awaiting presen- 
tation. 
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: P(a,b) 


[1] : P(a,b) 7 [3] : 5(c) 


2 


: Q(a,bY 


4 : R(b,c) 



[5] : Q{a, b) A R{b, c) 

Assume that node [1] is the local focus, the set {a, b} 
are the focal centers, [3] is a previously presented 
node and node [5] is the current task. [2] is chosen 
as the next node to be presented, since it does not 
(re)introduce any new semantic object and its overlap 
with the focal centers ({a, b}) is larger than those of 
[4] ({&}). 



The Bottom-Up Presentation Operators 

Under different circumstances the derivation of the 
next-node is also presented in different ways. The 
corresponding presentation knowledge is encoded as 
bottom-up presentation operators. The one most fre- 
quently used presents one step of derivation: 

Derive-Bottom-Up 

. p roof: INoder Wode n ?M 
!Node n+1 

• Applicability Condition: ?Node n +i is suggested 
by the focus mechanism as the next node, and 
INodei, . . ., lNode n are conveyed. 

• Acts: a PCA that conveys the fact that lNode n +i 
is derived from the premises !Node\ 1 . . ., lNode n 
by applying ?M. 

• Features: (bottom-up general explicit detailed) 

If the conclusion ? Node n+ \, the premises and the 
method 1M are instantiated to a G 5i, (a G 52, 
Si G 52), def-subset respectively, the following ver- 
balization can be produced: 

"Since a is an element of 5i, and 5i is a 
subset of 52 , a is an element of 52 according 
to the definition of subset." 

A trivial subproof may be presented as a single 
derivation by omitting the intermediate nodes. This 
next subproof is also suggested by the local focus. 
This is simulated by a bottom-up operator called 
Simplify-Bottom-Up. Currently seven bottom- up 
operators are integrated in PROVERB . 

6. Verbalization of PCAs 

Macroplanning produces a sequence of PCAs. Our 
microplanner is restricted to the treatment of the ref- 
erence choices for the inference methods and for the 
previously presented intermediate conclusions. While 
the former depends on static salience relating to the 
domain knowledge, the latter is similar to subsequent 
references, and is therefore sensitive to the context, in 
particular to its segmentation into attentional hierar- 
chy. Due to space restrictions, we only show the fol- 
lowing piece of a preverbal message as an example, be- 
ing a PCA enriched with reference choices for reason s 
and method by the microplanner [ Hua94b , Hua94b| . 



(Derive Reasons: (((ELE a U) explicit) 
((SUBSET U F) omit)) 
Conclusion: (ELE a F) 
Method: (Def -Subset omit)) 



Our surface generator TAG-GEN [ Kil94 | produces 
the utterance: 

"Since a is an element of U, a is an element 
of F." 



Notice, only the reason labeled as "explicit" is ver- 
balized. 

Finally, to demonstrate the type of proofs currently 
generated by PROVERB , below is the complete out- 
put for a proof constructed by O-MKRP: 

Theorem: Let F be a group and U a subgroup of F, 
if 1 and are unit elements of F and U respectively, 
then 1 = ljj. 
Proof: 

Let F be a group, U be a subgroup of F, 1 be a 
unit element of F and lu be a unit element of U. 
According to the definition of unit element, lu G U. 
Therefore there is an X, X G U. Now suppose that 
U\ is such an X . According to the definition of unit 
element, u\ * lu = u\. Since U is a subgroup of F, 
U C F. Therefore lu G F. Similarly u\ G F, since 
Mi G U . Since F is a group, F is a semigroup. Because 
Ui = 11%, lu is a solution of the equation ui * X = 
u\. Since 1 is a unit element of F, Ui * 1 = u±. Since 
1 is a unit element of F, 1 € F. Because u± G F, 1 
is a solution of the equation u± * X — u±. Since F is 
a group, lu = 1 by the uniqueness of solution. This 
conclusion is independent of the choice of the element 

m. 

7. Conclusion and Future Work 

This paper puts forward an architecture that combines 
several established NL generation techniques adapted 
for a particular application, namely the presentation 
of ND style proofs. We hope that this architecture is 
also of general interest beyond this particular applica- 
tion. 

The most important feature of this model is that 
hierarchical planning and unplanned spontaneous pre- 
sentation are integrated in a uniform framework. Top- 
down hierarchical planning views language generation 
as planned behavior. Based on explicit communicative 
knowledge encoded as schemata, hierarchical planning 
splits a presentation task into subtasks. Although our 
overall presentation mechanism has much in common 
with that of RST-based text planners, the top-down 
planning operators contain mostly complex presenta- 
tion schemata, like those in schema-based planning. 
Since schemata-based planning covers only proofs of 
some particular structure, it is complemented by a 
mechanism called bottom-up presentation. Bottom- 
up presentation aims at simulating the unplanned part 
of proof presentation, where a proof node or a sub- 
proof awaiting presentation is chosen as the next to 
be presented via the local derivation relations. Since 
more than one such node is often available, the local 
focus mechanism is employed to single out the can- 
didate having the strongest semantic links with the 
focal centers. The distinction between planned and 
unplanned behavior enables a very natural segmen- 
tation of the discourse into an attentional hierarchy. 
This provide an appropriate basis for a discourse the- 
ory which handles reference choices [ Hua94t| . 

Compared with proofs found in mathematical text- 
books, the output of PROVERB is still too tedious 



and inflexible. The tediousness is largely ascribed to 
the lack of plan level knowledge of the input proofs, 
which distinguishes crucial steps from unimportant 
details. Therefore, sophisticated plan recognition 
techniques are necessary. The inflexibility of text cur- 
rently produced is partly inherited from the schemata- 
based approach, for which a fine-grained planning in 
terms of single PCAs might be a remedy. It is also 
partly due to the fixed lexicon choice, which we are 
currently reimplementing. 
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